Active Tracking System with Rapid Eye Movement Involving Simultaneous Top-down and Bottom-up Attention Control
نویسندگان
چکیده
Visual tracking of complex objects has been studied extensively in the field of robot vision, visual servoing, and surveillance. Detection of reliable low level visual features has been crucial, in the literature, for the stability of tracking in cluttered scene. However, using only low level features for tracking tends to be illusory for practical vision systems, and the selection of reliable visual features is still an important unsolved issue. In recent studies it has been stressed the importance of high-level features in guiding control for long-duration tracking (Matsugu et al., 2006; Li et al., 2007; Yang et al., 2007), computer vision (Sun & Fisher, 2003), and visual search (Lee et al., 2005). Recent cognitive neuropsychological as well as brain imaging studies also revealed a role of top-down attention in visual search task in human vision system (Patel & Sathian, 2000; Hopfinger et al., 2000; Corbetta & Shulman, 2002; Navalpakkam & Itti, 2006). In this chapter, we present a new object tracking vision system that incorporates both topdown and bottom-up attention processes. Tracking a specific object using both low-level and high-level features is not new and was previously studied (Isard & Blake, 1998) in a stochastic framework. The proposed active vision system in this chapter is task-oriented and tracks a specific object (i.e., person). The tracking process is initiated by the top-down process which is activated by robust object detection module (Matsugu & Cardon, 2004; Matsugu et al., 2004). Subsequent feature selection for tracking involves simultaneous consolidation mechanism between higher level complex features (e.g., face) obtained from the top-down process and low level features from the bottom-up process (detailed description is in Section 3). The active vision system controls eye movement based on prediction of an attended object’s location by the above processes. Main contribution of this chapter is that we propose a stable object tracking algorithm involving selective attention together with FF and FB hybrid control that enable smooth and saccadic pursuit. Specifically, we introduce a coherency measure of tracking features. Tracking using such measure ensures stability and fast recovery from failure (missing the object to be tracked) by way of consolidation among bottom-up and top-down attention cues. For the bottom-up feature-based prediction of tracked object, we use local color histogram, and histogram intersection (Swain & Ballard, 1991; Birchfield & Rangarajan, 2005) is used for feature matching. High-level, top-down feature for tracking is defined as detected face O pe n A cc es s D at ab as e w w w .ite ch on lin e. co m
منابع مشابه
روش ردیابی چشم در تعامل انسان رایانه، بررسی فرایند تعامل برپایه داده های حرکات چشم
Nowadays most of the day today services we receive are based upon computer systems. Services such as information searching or online shopping are considered among the most frequent online information systems' services. Users assess and process the information they receive from information systems. The theory of mind information processing asserts that humans process and analyze the information ...
متن کاملOn The Bottom-Up and Top-Down Influences of Eye Movements
To cope with the enormous amount of visual information in our everyday environment, the human visual system uses a mechanism of visual attention and saccadic eye movements to filter and process only the relevant information. In this study, we try to analyze and model the control of these eye movements. Eye movements are controlled by bottom-up and topdown mechanisms. The role of these two mecha...
متن کاملThe Effect of Bottom-up/Top- down Techniques on Lower vs. Upper -Intermediate EFL Learners’ Listening Comprehension
Listening is regarded as an interactive process involving decoding of information. This study was launched to find out the impact of bottom-up (BU) and top-down (TD) techniques on Iranian lower and upper intermediate learners’ listening comprehension. We selected a total of 120 participants in six intact classes, three lower intermediate and three upper intermediate. The proficiency level of th...
متن کاملTop-down Attention Supports Visual Loop Closing
In this paper, we present a method to improve the loop closing behaviour for visual SLAM. Landmarks consist of a combination of attention regions and Harris-Laplace corners. The attention regions are detected by a visual attention system which combines image-based, bottom-up and target-related, topdown information. The ability to perform target-directed search is used to search for expected lan...
متن کاملGoal Directed Visual Search Based on Color Cues: Co-operative Effects of Top-down & Bottom-up Visual Attention
Focus of attention plays an important part in our perception of the world around us. Visual search is a combined effort of the top-down (cognitive cue) and bottom-up (low-level feature conspicuity) processes. Often during visual search our attention involuntarily gets directed to some irrelevant conspicuous objects, such as a bright object, regardless of the cued object. Objects that share simi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012